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ABSTRACT 

The 'Standard European Vector Architecture' 
database (SEVA-DB, http://seva.cnb.csic.es) was 
conceived as a user-friendly, web-based resource 
and a material clone repository to assist in the 
choice of optimal plasmid vectors for de-cons- 
tructing and re-constructing complex prokaryotic 
phenotypes. The SEVA-DB adopts simple design 
concepts that facilitate the swapping of functional 
modules and the extension of genome engineering 
options to microorganisms beyond typical labora- 
tory strains. Under the SEVA standard, every DNA 
portion of the plasmid vectors is minimized, edited 
for flaws in their sequence and/or functionality, and 
endowed with physical connectivity through three 
inter-segment insulators that are flanked by fixed, 
rare restriction sites. Such a scaffold enables the 
exchangeability of multiple origins of replication 
and diverse antibiotic selection markers to shape a 
frame for their further combination with a large 
variety of cargo modules that can be used for 
varied end-applications. The core collection of 
constructs that are available at the SEVA-DB has 
been produced as a starting point for the further 
expansion of the formatted vector platform. We 
argue that adoption of the SEVA format can 
become a shortcut to fill the phenomenal gap 
between the existing power of DNA synthesis and 
the actual engineering of predictable and effica- 
cious bacteria. 



INTRODUCTION 

The failure of the early pioneers of recombinant DNA 
technologies to implement standards for the construction 
and nomenclature of plasmids (1) and other genetic tools 
has resulted in decades of chaotic development of cloning 
vectors and other molecular assets. While these circum- 
stances have not limited laboratory studies, the necessity 
of fixed formats for vector organization and designation 
has become increasingly apparent as we enter the era of 
Systems and Synthetic Biology (2,3). We are witnessing a 
fast transition from the handling of a few-at-a-time genes 
toward the implementation of complex genetic circuits 
(4,5) and, ultimately, engineering living cells to acquire 
new-to-nature properties (6). However, the biological 
functions and devices that are necessary for such engi- 
neering may not be characterized to a point that makes 
them usable and predictable (2). Finally, the vast majority 
of genetic engineering efforts are made in Escherichia coli. 
This is an excellent host for the physical assembly of DNA 
parts into devices and modules (6), but, often, this 
organism is not the optimal platform for the deployment 
of the designed properties in a biotechnological setting, 
e.g. a bioreactor or released to the environment (7,8). 
Many attempts to expand the vector toolbox beyond 
enteric bacteria have been made in the past decades (9- 
12) and include a large suite of broad-host-range plasmids 
(13), transposons and transposon-vectors (14-16). The 
latter two methods have been instrumental for the 
random chromosomal insertion of mobile DNA (e.g. 
systems based on TnJ; 15,17,18), site-specific targeting of 
heterologous DNA sequences (Tn7 and CTX; 19) or a 
combination of the two (7,20). While such vectors have 
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permitted the study and enhancement of many desirable 
properties of Gram-negative bacteria, their molecular 
architectures have been generally capricious and largely 
dependent on the immediate application for which they 
were created. 

The regrettable lack of format of tools that are 
developed in different laboratories for the same purpose 
most often impede data comparison and typically require 
re-cloning of the same sequences into available familial 
vectors, which becomes a veritable bottleneck for engi- 
neering of devices that involve parts from various origins. 
Furthermore, the functional components of the vectors 
(e.g. origin of replication or selection determinant) are 
often assembled within the naturally occurring segments 
(which may carry undesirable internal sites) and pasted 
into existing restriction sequences. Frequently, broad- 
host-range vectors, which are useful for their specified 
purposes, end up being unnecessarily large and 
burdened with useless, if not fastidious, DNA sequences. 
While all of these issues may be ultimately resolved by 
the synthesis of entire genomes d la carte (21), the im- 
mediate future still needs better molecular tools for the 
following: (i) exploration, analysis and de-construction 
of the functional space of existing bacteria; (ii) re- 
construction of the system with the same or a different 
connectivity of constituents for creating new properties 
and (iii) deployment of the features of interest in the 
most advantageous carrier. In this background, it 
comes as a surprise that the vast literature on cloning 
vectors and genetic tools for bacteria has drifted 
toward extreme diversification rather than standardiza- 
tion and only a few instances can be noted that 
overcome this state of affairs. One example is the prop- 
osition of the BioBrick standard (22), which allows for 
the successive incorporation of DNA parts that are 
retrieved from a formatted repository of functional se- 
quences to a cognate plasmid scaffold. This is a suitable 
platform for the recurrent physical assembly of DNA 
segments, but it does not tackle the issue of functionality 
or deployment of the constructs. A variant of the 
BioBrick approach is the BglBrick approach (23), 
which exploits the loss of Blgll and BamHI sites when 
DNA segments that are cleaved with either enzyme are 
ligated to each other in a fashion that allows for regen- 
eration of a usable Blgll sequence after each round of 
insertion. This method has been useful for assembling a 
large collection of valuable vectors with various combin- 
ations of replication origins, selection markers and ex- 
pression systems (24). Finally, one could entertain the 
assembly of plasmids d la carte by recombining 
relevant DNA sequences in vitro with suitable enzymatic 
cocktails (25,26). While all of these advances constitute 
steps in the right direction, they also ask for a breach 
from the more traditional cloning procedures, which may 
not fit the invested interests of a large community of 
existing users and available constructs. 

The database (DB) described below is an attempt to 
overcome these problems by creating a coherent 
platform of molecular tools that are subject to a concise, 
minimalist and standardized format and nomenclature. 
Importantly, these tools are compatible with old and 



new cloning and DNA assembly methods. The resulting 
set of vectors, which are available in the DB, is composed 
of a number of synthetic, interchangeable and reusable 
functional modules that include broad-host-range origins 
of replication, antibiotic markers, expression systems and 
reporter genes that are punctuated by terminators and 
gadget-insertion sites. The first collection of these 
formatted vectors was intended for analysis and reassem- 
bly of engineered phenotypes in a suite of Gram-negative 
bacteria. We advocate that what we call the 'Standard 
European Vector Architecture' (SEVA) may become a 
fundamental reference to speed up the ease of biosystems 
engineering beyond laboratory applications. 

DATABASE DESCRIPTION 

Database organization 

The SEVA-DB (http://seva.cnb.csic.es) is a resource for 
implementing a standard for the physical assembly of 
vector plasmids and their nomenclature. In addition, the 
DB serves as an index for the repository of functional 
sequences and the actual constructs that are available to 
the community. SEVA-DB was developed with an archi- 
tecture that consists of a relational DB as the data storage 
layer, a series of modules that are hosted by an application 
server and a web-based presentation layer with an explicit 
set of standards that apply to all constructs. The DB was 
designed to simplify the choice of a given vector for 
specific applications in such a way that the user can 
easily decide what is the best configuration of replication 
origins, antibiotic resistances and business segments, as 
explained below (Figure 1). Furthermore, the user is 
given instructions on how to order vectors in the collec- 
tion, contribute to the platform with new constructs, 
obtain a standard code for novel plasmids that follow 
the SEVA format and report problems. To this end, the 
tracking components include core data, clone-order data 
and user-management information. The core data include 
all relevant aspects of vector information such as origins 
of replication, antibiotic selection markers and cargo 
segments. The clones/vectors in the repository collection 
contain no inserts. Most clones are readily available; 
however, if mutations that are acquired during storage 
are detected, those clones may be temporarily removed 
from the active list of offered constructs, but their 
SEVA code and the link to their cognate DNA sequences 
to GenBank are maintained. 

The SEVA standard 

The SEVA platform involves the adoption of a plain set of 
rules for the physical assembly of the three basic compo- 
nents of plasmid vectors (i.e. origin of replication, selec- 
tion marker and cargo segment, see below), edition and 
synthesis of the corresponding DNA sequences to remove 
any restriction sites that punctuate the general frame and 
implementation of an alphanumeric code for the resulting 
genetic tools. The schematic representation of the archi- 
tecture of the SEVA vectors is shown in Figure 2a. To 
standardize the SEVA modules, a number of criteria 
were followed. First, each of the naturally occurring 
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Figure 1. Representative search results of the SEVA-DB. The welcome page includes a header with active links to the formatted plasmid structure, 
plasmid backbone, modules, nomenclature, plasmid list and contact information. Typically, the user will browse through the plasmid list, where the 
collection of constructs are ordered according to their origin of replication, antibiotic selection marker, type of cargo and a SEVA code. The page for 
each plasmid also has active links to the corresponding GenBank number and the complete DNA sequence of each construct. 



sequences that were destined for the various constructs 
were minimized to the shortest DNA segments that 
retained full functionality. The sequences were then 
erased of any of the restriction sites that were present in 
the polylinker of the classical vector pUC18 (27) and of 
Sfil, Avrll and NotI, but the encoded amino acids were 
retained by the mutated codons. The optimized fragments 
were produced by either site-directed mutagenesis of the 
DNA (28), as required, or were entirely chemical 
synthesized. Each of the three fragments was then 
assembled in a shared frame that was composed of three 
connecting parts. The plasmid vectors are, therefore, 
composed of six functional modules and the location 
and orientation of the three connectors provide the 
backbone of reference for the standardized vector collec- 
tion. The sequences that link the variable parts include the 
strong, r/20-independent transcriptional terminators T 0 of 
phage lambda (103 bp) and T. (105 bp) of the rrnB operon 
of is. coli (29). These terminators function by avoiding any 
transcriptional read-through into adjacent sequences, 
which increases plasmid stability (30). The third connect- 
ing element comprises a 246-bp DNA segment from the 



conjugative, broad-host-range plasmid RP4 (31) and is the 
plasmid's very efficient origin of transfer (oriT). This oriT 
allows for conjugative mobilization of pSEVA plasmids 
into organisms in which no alternative transformation 
protocols are available (32; see below). Note that while 
oriT enables constructs to engage in conjugative gene 
transfer, its addition to SEVA vectors does not increase 
significantly chances of unintended uptake by naturally 
occurring microorganisms, which is largely caused by ac- 
cidental DNA transformation. Because these three core 
sequences (T 0 , T. and oriT) are shared by all plasmids 
of the series, specific primer pairs (PS1-PS6, Figure 2b) 
were designed that hybridize to these regions and can be 
used to amplify any of the intervening segments. 

The first variable part of the pSEVA constructs consists 
of the antibiotic selection marker. As shown in Figure 2a, 
the DNA segments that carry such markers include the 
structural gene for one antibiotic (Ab) resistance gene 
and its native promoter. The unusual Swal and PshAI 
restriction sites flank the Ab resistance unit and the 
expression of the Ab gene is oriented toward the onT-con- 
necting module. The size of the DNA segment for such an 
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TTTAATTAAAGCQGATAACAAITTCACACAQSAGGCCGCCTAGGCCGCGGCCGCGCGAATTCGAGCTCGGTACCCGGGGATCC 



Pad Sfil EcoRI Kpnl BamHI 

Sail Sphl Notl 

90 100 110 120 130 140 150 

TCTAGAGTCGACCTGCAGGCATGCAAGCTTGCGGCCGCGTCGTGACTGGGAflAACCCTGGCGACTAGT 

Xbal Pstl Hindlll F24 Spel 

Figure 2. Overall organization of structure of SEVA plasmids. (a) 
SEVA vectors are formed by three variable modules: a cargo (blue), 
a replication origin (green) and an antibiotic marker (magenta). 
Enzymes used to change the functional DNA segments are shown in 
the same color code and modules are separated by three permanent 
regions, which are shared by all vectors, the T 0 and T[ transcriptional 
terminators and the oriT conjugation origin. All universal primers 
(PS1-PS6) are placed within the invariable backbone and are used to 
sequence/check the variable modules, (b) The structure of the default 
SEVA cargo. Cargos are cloned as Pacl/Spel fragments. The default 
segment contains the typical pUC18 polylinker enzymes from EcoRI to 
Hindlll (the completely ordered restriction enzyme list is highlighted in 
the figure). Additional enzymes (i.e. Sfil, Avrll and Notl) are placed 
outside of the polylinker for specific cloning purposes. Within the cargo 
sequence, the enzyme recognition sites and the hybridization site of the 
universal Ml 3 (R24/F24) primers are shown. 

Ab resistance unit ranges from 0.8 to 1.3 kb, depending on 
the specific marker (see below). The second exchangeable 
part of the SEVA vectors is a DNA segment that holds the 
origin of replication of the plasmid, which is produced and 
inserted in the plasmid frame as an Ascl-Fsel fragment. 
The composition of such segment that endow replication 
is more complex than the one-gene counterparts for anti- 
biotic selection because they include an oriV for replica- 
tion initiation and may encode specific replication proteins 
and their corresponding promoters. Although the orienta- 
tion of these sub-components may vary, the SEVA 
standard entails that the oriV sequence be proximal to 
the Ascl end of the DNA fragment and that transcription 
of the replication protein(s), if any, points toward the Fsel 
site of the oriT connector (Figure 2a). The 
broad-host-range replication origins that are selected to 
this end have different sizes (1.6-3.7 kb) and endow the 
plasmids with different copy numbers, which are decided 
by the user. Finally, the third variable element of the 
SEVA constructs is the DNA portion that bears the 
main functionality of the vector. We have designated 
this element as the 'cargo module' (31) and this module 
is always formatted as a Pacl-Spel fragment. This region 
confers a specific purpose to the plasmid, whether for 
cloning, expression of heterologous genes, creating 



reporter gene fusions or for chromosomal integration. 
Note that DNA fragments that are bordered by 
Pacl-Spel sites can also be cloned into the synthetic trans- 
poson vector of pBAMl (31,33), thus allowing for stable 
chromosomal integration of a DNA segment that is 
assembled in SEVA plasmids. Figure 2b shows the 
sequence of the default SEVA cargo, which consists of a 
basic polylinker that is formed by the array of unique 
cloning sites of pUC18 (27), Sfil/Avrll and Notl sites 
upstream of the EcoRI site and a second Notl site that 
is placed downstream of Hindlll site. These flanking Notl 
sequences allow concatenation of cloned fragments (posi- 
tioned between EcoRI and Hindlll) into derived SEVA 
vectors where Notl is unique (see below). This format in- 
creases the number of fragments that can be assembled 
into a SEVA cargo. The same principle is used for the 
Sfil site (18), but note that the Avrll site is used with 
Pad to implement various expression systems or other 
genetic devices, as explained in the next sections. The 
Ml 3 phage F24 and R24 primer sequences are present 
in the default cloning cargo to allow for verification of 
inserted fragments. Figure 2b also highlights the restric- 
tion site sequences and the positions of the Ml 3 primers. 

Nomenclature of the pSEVA plasmids 

In addition to the standardized layout of the constructs 
that is sketched in Figure 2a, the SEVA format calls for 
the designation of each of the vectors with the code that is 
summarized in Figure 3. Under this criterion, all vectors 
are named pSEVA followed by a multi-digit cipher. The 
first position of the cipher reflects the antibiotic resistance 
marker, and at the time of writing this paper, six had been 
used (see below). The second position of the code is for the 
origin of replication (five types thus far). Finally, the third 
spot indicates the more variable cargo module (eight types 
for now). Because a large number of cargos are possible, 
the third position can be extended with serial numbers and 
further qualified by using one or more upper/lower case 
Latin letters that act as a prearranged ID code. Finally, 
vectors with a genetic gadget that is inserted at the unique 
sites between the functional modules and the connector 
sequences are to be noted with a Greek letter that is 
placed before or after the position corresponding to the 
more proximal segment. The default pSEVAlll vector 
from which all others originate has an Ap R marker gene, 
the narrow-host-range origin of replication R6K and the 
default SEVA polylinker, as explained in the next sections. 

The pSEVA housekeeping components: antibiotic 
resistances and origins of replication 

The first series of standardized antibiotic markers comprise 
those that are commonly used for selection in Gram- 
negative bacteria. The set of generated cassettes included 
genes for resistance to ampicillin (Ap, acquired from the 
bla gene of pBAMl; 31), kanamycin (Km, the aphA gene 
was amplified from the pBAMl plasmid; 31), chloram- 
phenicol (Cm, the cat gene from the pCClFOS plasmid; 
Epicentre Biotech), streptomycin/spectinomycin (Sm/Sp: 
the aadA gene from pVLT35 plasmid; 10), tetracycline 
(Tc, the tetA gene from plasmid pBBRl-MCS3; 12) and 
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Figure 3. SEVA nomenclature. Plasmid vectors are named using digits 
that reflect their functional modules. The codes include at least three 
positions: the first stands for antibiotic markers, the second for the 
replication origin and the third for the cargo. The lists of possible 
modules for each position are given, different versions of cargos are 
possible, and additional digits and letters are used to specify the modi- 
fication (as shown in the lower right list). 



gentamicin (Gm, the aacCl gene from pBBRlMCS-5; 12). 
A complete description of the 'wet' standardization 
process that was performed for each exchangeable 
module will be published elsewhere (E. Martinez-Garcia 
et al., manuscript in preparation). 

The antibiotic markers were then combined into the 
SEVA frame in Figure 2a with a variety of origins of re- 
plication, i.e. narrow-host range and broad-host-range 
(Figure 4). In this article, we have focused on origins 
that are more commonly used and can be employed for 
manipulations of environmental, Gram-negative bacteria. 
One extreme of the series is the narrow-host range R6K 
origin. Replication of the R6K origin is dependent on the 
n protein (34); therefore, it can only be maintained in E. 
coli cells that express the pir gene in trans (35). This char- 
acteristic makes R6K very useful for genome-editing tech- 
niques, where chromosomal integration of a specific 
plasmid is required (31,36). The standardized oriV R6K 
includes a minimal 392-bp sequence that was excised from 
pBAMl (31) and combined with the default cargo and 
each of the antibiotic markers. The resulting R6K- 
vectors were then used as templates for replacing the rep- 
lication module with a number of broad-host-range 
(BHR) counterparts that were edited as indicated previ- 
ously and formatted in all cases as an Ascl-Fsel fragment 
(Figure 4). The architecture of these origins consists of a 
minimum sequence for replication initiation (oriV) and 
one or more replication proteins (rep genes). The collec- 
tion of reference constructs includes four of these (see 
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Figure 4. Organization of the edited SEVA plasmid replication origins. 
The R6K suicide origin lacks the gene encoding the n replication 
protein, which makes its maintenance dependent on the trans delivery 
of that protein. RK2 is formed by the vegetative origin (oriV) and the 
replication protein trfA. pBBRl has a similar structure to RK2; 
however, the pRO1600/ColEl origin is a hybrid of the pRO1600 
origin and the ColEl replication sequence. For RSF1010, the mob 
genes have been eliminated and the synthetic sequence has only the 
oriV and the rep B AC genes, which are expressed from a PlacUV5 
promoter. 



sketches of Figure 4). The first and smallest (1.52 kb) 
BHR origin comes from the medium-copy number and 
small broad-host-range plasmid pBBRl, which was first 
isolated from the Gram-negative bacterium Bordetella 
bronchiseptica (37,38). This module is the basis of a large 
number of vectors (12,39) and includes only an oriV and a 
replication protein. One downside of such a DNA segment 
is that its copy number may be disadvantageous for some 
applications, e.g. regulation studies, but it is an excellent 
and stable replicon for most purposes. The next-larger 
module contains a hybrid of the origins of the 
narrow-host-range E. coli plasmid ColEl (27) and the rep- 
lication determinants of pRO1600, which is plasmid from 
a Pseudomonas aeruginosa isolate whose functionality is 
limited to closely related species (40). The ColEl and 
pRO1600 origin combination is derived from vector 
pUCP24 (11), which was used to produce a 1.97-kb repli- 
cation module for the SEVA collection. Note that a 
plasmid with the ColEl oriV is found at a high copy 
number in E. coli, while the number of plasmids with 
the pRO1600 origin may vary from host to host (41). 
The third origin is the 2.22-kb replication origin of the 
broad-host-range IncPa plasmid RK2 (42). The 
standardized ori segment is formed by oriV followed by 
the gene that encodes the replication protein TrfA, which 
is expressed from its native BHR promoter (Figure 4). The 
immediate source of this RK2 origin was pJB785TT (43) 
and the relevant sequence of this origin was used for 
formatting the corresponding module. This origin of 
replication is among the least restrained and it keeps its 
copy number per cell very low (44). Finally, we formatted 
and edited a minimized version of the replication deter- 
minant of the IncQ, hyper-promiscuous replicon RSF1010 
(45). The host range of the IncQ plasmids includes the 
enterics, pseudomonads and their proteobacteria relatives 
and more distant species (i.e. firmicutes, actinomycetes, 
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cyanobacteria and acidophilic bacteria that are associated 
with mining environments). Not surprisingly, RSF1010 
has been extensively exploited since the onset of genetic 
engineering for developing a large number of plasmid 
vectors (10,46). However, the extant arrangement of the 
RSF1010 replication elements were not optimal for our 
purposes because the oriV and the cognate replication 
proteins are encoded in DNA segments that are 
intertwined with mobilization and transfer determinants 
(45). To overcome this shortcoming, we created a 
streamlined segment that only contained the RSF1010 
oriV and was positioned adjacent to the repBAC genes, 
which were expressed through an engineered Pi ac uvs 
promoter (Figure 4; 47). The resulting synthetic origin is 
only 3.7 kb, which is smaller than the 5.4-kb region of the 
naturally occurring replicon (48,49). This new origin splits 
the replication of plasmids containing this module (as well 
as any of the previously mentioned origins of replication) 
from their conjugal transfer abilities. This property is 
present by default in all SEVA constructs by virtue of 
the third Fsel-PshAI connector element of the plasmid 
backbone (Figure la), which bears the oriT of the BHR 
and self-transmissible RK2/RP4 plasmid that was men- 
tioned earlier. Note that the conjugation machinery of 
this plasmid is one of the most promiscuous because it 
can deliver DNA to other Gram-negative and Gram- 
positive species and even to yeast (50) and mammalian 
cells (51). Furthermore, efficient mobilization of 
RK2onT-containing plasmids can be brought about by 
transient expression of the corresponding tra/mob genes 
(32,52), which is a property that has been exploited for a 
long time to deliver exogenous DNA to hosts that are 
recalcitrant to straight transformation (53). 

The six antibiotic resistance markers were combined 
with the five, above-mentioned origins of replication in 
the vector frame depicted in Figure 2a that bears the 
default polylinker in Figure 2b. Doing so created a collec- 
tion of 30 reference vectors, which are listed and described 
in the SEVA-DB. Notice that their different replication 
machineries are unlike each other, thereby ensuring their 
compatibility in the same bacterial host if they are 
co-selected with the appropriate antibiotics. Notice also 
that the plasmids are neither endowed with specific repli- 
cation termination signals (54) nor with active segregation 
systems (55), which are specific for each plasmid and host 
in their natural settings. In the absence of these functions, 
cells containing SEVA plasmids are expected to resolve 
replication intermediates through the action of housekeep- 
ing topoisomerases (56) and to maintain the constructs 
through antibiotic selection. If this becomes a problem 
in specific hosts, the vectors can be improved for a 
given purpose by inserting toxin-antitoxin or counter- 
segregation systems (e.g. a parB cassette; 57) at the 
unique 'gadget sites' that were purposefully left available 
in the vector frame at the boundaries between the func- 
tional modules and the connecting elements (Figure 2). 
Note that the compatibility of different but compatible 
SEVA plasmids that encode diverse functionalities in the 
same cell expands the number and range of possible ap- 
plications of these vectors. 



Cargos for cloning and genome editing 

The 'cargo module' is the most important part of the 
SEVA vectors because it is where the user implants the 
sequences to be examined (e.g. regulators, promoters and 
other actuators). A number of different cargos have been 
initially set up for the starting vector collection. The 
SEVA standard states that the main transcription flow 
of whatever device is designed as a cargo should proceed 
from the Pad site toward the Spel site of the plasmid 
frame (Figure 2a). In this way, the intervening segment 
is transcriptionally insulated upstream by the Tj termin- 
ator, which is present in the first connecting sequence and 
downstream by the T 0 terminator. The repository of 
produced cargos includes four types of applications. 
First, 'cloning cargos' include (in addition to the default 
polylinker of the vector in Figure 2b) a SEVA-formatted 
variant of the widely used lacZct cloning platform for 
blue/white screening of transformed cells containing 
plasmids with the cloned fragments on X-gal plates (27). 
These cargos also afford the expression of heterologous 
genes from the endogenous Plac promoter when placed in 
the correct orientation. As shown the SEVA-DB, the 
pUC19/18 cloning cargos were combined with the 
pBBRl and pRO1600/ColEl origins of replication and 
all possible antibiotic markers, thus generating 24 BHR 
cloning vectors. A special collection of /acZa-containing 
plasmids with the narrow-host-range oriV R6K were se- 
parately developed for editing (i.e. inserting into, deleting 
portions of or replacing specific portions of) the genomes 
of Gram-negative bacteria (31). In this case, the cargo 
consisted of the lacZa polylinker of pUC19, which was 
flanked in-frame by two I-Scel-targeted sequences for 
site-specific genomic deletions. Such a formatting (and 
the resulting six additional vectors, see SEVA-DB) 
expands the utility of the original procedure (36) to 
bacteria that are naturally resistant to Km, which was 
the only selection marker that was available up to that 
point for the genome-editing method. 

Broad-host-range heterologous expression 

Other types of cargos were designed for the regulated 
expression of heterologous genes. Typical genetic devices 
to this end include one transcription factor (TF), the 
activity of which is triggered or inhibited by a specific 
inducer and a target promoter, upon which the regulator 
drives transcription of the desired gene(s). Two expression 
systems of this sort were formatted following the SEVA 
standard as part of the reference plasmid collection. The 
first system consists of plasmids containing the 
IPTG-inducible lacP-Ptrc system, which was excised 
from the pTrc99a vector (58) and assembled in the 
SEVA frame as a Pacl-Avrll fragment so that the 
regulated promoter points toward the Avrll site. In this 
way, users can benefit from the remaining downstream 
cloning sites of the default polylinker (Figure 2b). Note 
also that this expression cargo (as well as any other SEVA 
counterpart, see below) lacks translation signals (e.g. 
Shine-Dargarno ribosome binding sequences), which 
may need to be engineered on a case-by-case basis 
(59,60). A second broad-host-range expression system 
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includes the benzoate/m-toluate-responsive XylS regulator 
of the TOL plasmid of Pseudomonas putida mt-2 and its 
cognate promoter Pm. The system is configured like the 
pJP plasmid series (9), but the DNA was edited so that five 
restriction sites that were found in the original sequence 
were removed and the segment was generated as a 
Pacl-Avrll restriction fragment. The SEVA-DB displays 
a non-exhaustive list of pSEVA variants with each of the 
two expression cargos, which are merged with the R6K, 
RK2 or pBBRl origins of replication and different anti- 
biotic markers. The combination of the two expression 
modules with various plasmid copy numbers cover most 
transcriptional capacities and dynamic ranges (see below) 
that are usually required, but they can be easily replaced 
by other expression cargos and expression devices to 
satisfy particular specifications. One large source of regu- 
latory cargos is the many TFs that regulate the expression 
of biodegradative operons that encode xenobiotic and re- 
calcitrant compounds in soil bacteria (61,62). 

Parameterization of promoter output 

The last type of cargo that is available in the basic SEVA 
collection consists of a compilation of reporter genes for 
the visualization and quantitative measurement of expres- 
sion in time and space. Reporters allow for the parameter- 
ization of regulatory interactions (e.g. the characterization 
of the transfer function of a given regulator-promoter 
pair). In most cases, the reporter product is encoded 
within a promoterless cassette, which contains the corres- 
ponding ORF and is preceded by a RBS. When a reporter 
is placed downstream of a transcription initiation device 
(e.g. a regulated promoter), its readout reflects the 
intensity of the transcriptional output. In other cases, 
the coding sequence of the reporter product is fused 
upstream or downstream of another gene to generate a 
translational fusion. Under these conditions, the optical 
or enzymatic readout of the hybrid reports its transcrip- 
tion and translation. It follows that when transcription is 
constant, translational fusions report the actual produc- 
tion of the hybrid protein. These features have a large 
number of applications for characterizing riboswitches 
(63) and small RNAs (64). In this background, we have 
formatted some of the most common reporters as Hindlll/ 
Spel fragments so that they can be easily inserted down- 
stream of any transcription initiation device at the default 
SEVA polylinker. The reporter cargos include intact and 
truncated versions of the P-galactosidase gene of E. coli 
(lacZ, ~3 kb), which can be fused to another gene to make 
either transcriptional or translational fusions to this wide- 
spread indicator gene (65). This reporter is the only thus 
far whose activity can be expressed in absolute units (i.e. 
Miller units) and, despite its shortcomings (e.g. lengthy 
manipulations of cells, lack of an internal standard of refe- 
rence; 66), it is still a very useful asset for studies on 
promoters in virtually any biological host. Similarly, the 
collection also contains the complete 5-kb, promoterless 
luxCDABE operon from Photorhabdus luminescens that 
encodes all enzymatic activities for emission of light (16). 
This reporter can be followed without any disruption of 
the cells and quantified very accurately (65). The reaction 



that releases photons is dependent on the physiological 
status of cells, specifically on FMNH 2 , which is a 
cofactor that sharply declines as cells enter stationary 
phase (67). Still, the lux reporter is optimal for measuring 
promoter output in steady-state growing cells, which is a 
common circumstance in laboratory experiments. Finally, 
the reporter cargo collection has five promoterless genes 
that encode different fluorescent proteins (i.e. GFP, CFP, 
YFP, dsRed2 and mCherry), all of which are formatted 
following the Hindlll — >■ Spel standard of the SEVA 
frame. Because fluorescence is not diffusible, these re- 
porters provide extraordinary detail, such as stochasticity, 
phenotypic variation and transcriptional noise, on gene 
expression at the single-cell level (68,69). Furthermore, 
the same bacteria can be engineered to express fluorescent 
proteins of various colors that are borne by either the 
same vector or by compatible SEVA plasmids, thereby 
allowing the monitoring of different processes in 
individual cells. Note that each of the reporter genes of 
the SEVA-DB has been standardized to also have the 
same, optimized RBS sequence ( AGGAGG AAAAACA 
T) that was described for GFP of the pGreenTIR vector 
(70). The maintenance of the same architecture in the 
reporter cargos allows for comparison of results that are 
generated using different indicator genes and for the pro- 
duction, whenever needed, of conversion tables for differ- 
ent units of measurement. Ultimately, we expect these 
constructs to become instrumental for assigning the 
values of reporter readouts to absolute counts of RNA 
polymerase per second (PoPS counts; 65) of the regulatory 
node that is under investigation. Following the 
above-described numbering procedure, cognate plasmids 
are assigned the numeral 7 in the third position of the 
SEVA code because they are all variants of the original 
GFP protein, but the numeral is then followed by a letter 
to indicate the cargo: 7 for GFP, 7Y for YFP, 7C for CFP, 
7D for DsRed2 and 7R for mCherry. 

Sharing and distribution of SEVA plasmids 

Plasmids that are available in the SEVA-DB and those 
that will enrich the collection in the future are distributed 
worldwide to researchers at any academic or non-profit 
laboratory. No MTAs will be requested prior to 
shipping of the constructs to potential users of such 
categories. When utilizing the constructs for commercial 
or industrial purposes, solicitors should be aware that 
some of the DNA sequences that were included in the 
SEVA plasmids could be subject to intellectual property 
restrictions. In this regard, the SEVA platform disclaims 
any carrier liability. Contributors to the collection must 
adhere to such an open-access policy or they should not 
number their constructions with the SEVA nomenclature. 
Clones are distributed either as glycerol stocks, stabs in 
agar tubes or purified plasmid DNA and in keeping with 
accepted practices (71), the SEVA-DB will distribute pe- 
titions of a small number of clones at no cost to the user 
but may charge handling, maintenance and shipping fees 
for large requests. 

Users are encouraged to produce their own cargoes and 
other segments in the format that is advocated in this 
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Avrll Notl EcoRI 




Figure 5. Combination of expression and reporter cargos in the SEVA 
plasmids. (a) The structure of the default cloning cargo with the uni- 
versal Ml 3 primers (F24 and R24) and relevant restriction sites, (b) 
Assembling expression systems as Pacl/Avrll fragments removes the 
R24 primer sequence and Sfil sites upstream of the Avrll site, (c) 
Cloning of reporter genes as Hindlll/Spel fragments removes the F24 
primer sequence and a Notl site that is placed downstream of the 
Hindlll site, (d) Combination of an expression system and reporter 
gene leaves an intact polylinker (from EcoRI to Hindlll) and a 
unique Notl site between the Avrll and EcoRI sequences, which can 
be used to clone large constructs. 



article and communicate those sequences to the plasmid 
repository through the SEVA-DB web page for quality 
control. As mentioned earlier, each of the SEVA cons- 
tructs is allocated a core, multi-digit code that ciphers its 
physical organization. This code can be enriched with 
Latin and Greek letters, which consign variants of the 
default modules or genetic gadgets, to endow plasmids 
with specific properties. Contributions to the collection 
of SEVA vectors must be accredited by the curators of 
the platform to follow the above-specified standards, 
stored in the SEVA repository and assigned a definitive 
code prior to public release. Furthermore, users are 
discouraged from assigning SEVA codes to constructs 
that are not explicitly endorsed by the SEVA-DB 
managers. 



FUTURE PERSPECTIVES 

The structure of the SEVA plasmids allows for the 
straightforward combination of expression systems and 
reporter (or otherwise heterologous) genes such that any 
input (e.g. IPTG or benzoate) can be genetically wired to 
any output (lacZ, lux, GFPs) and the plasmid will still 
retain the intact multiple cloning site (from EcoRI to 
Hindlll) that can be used for introducing other combina- 
tions of TFs and promoters a la carte (the possible ways of 
connecting SEVA cargos are exemplified in Figure 5). This 
modularity and the compatibility of the various replicons 



allow for the assembly of complex circuits in the same host 
and the performance/behavior of each sub-circuit to be 
easily monitored. Still, we argue that the importance of 
the SEVA platform is the expansion of the contemporary 
Synthetic Biology toolbox for bacteria to actual applica- 
tions, e.g. for biocatalysis or bioremediation (72-75). To 
this end, the current SEVA-DB focuses on a number of 
broad-host-range replication origins and we have left 
typical E. coli replicons aside (22). 

Supplementary Table SI shows a non-exhaustive list of 
organisms where the current list of SEVA vectors have 
been tested and found to be functional (13). The RK2- 
and RSFlOlO-based vectors are, as expected, the most 
promiscuous. However, it is likely that the host range of 
the pBBRl-based and pRO1600-based vectors are under- 
stimated because they have been tried in only a small 
number of hosts. The modularity of the SEVA format 
allows for growth of the collection toward other replicons, 
such as those containing functional parts of Gram-positive 
organisms. The same holds true for the other functional 
segments, in particular, the cargo modules. Cargos 
are being expanded in our laboratory to include genetic 
mobile elements, new regulated-expression devices, 
systems for detecting/monitoring protein-protein inter- 
actions and various others. The community is encouraged 
to contribute to this effort to enrich the SEVA collection. 
New constructs will be entered in the SEVA-DB and made 
available as soon as they are verified to follow the 
standard. In summary, we advocate that adoption of the 
SEVA format will multiply the possibilities of pro- 
gramming a large number of bacteria beyond E. coli and 
will thus help to ease the transition between trial-and-error 
genetic engineering to systems-based Synthetic Biology 
(6,65,76). 



SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Table 1. 
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